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^ This invention relates to a novel series of proteins which 
exhibit procoagulant properties . These proteins have marked 
structural differences from human factor VIII :C, but have 
5 similar procoagulant activity. 

Factor VIII :C is the blood plasma protein that is defective 
or absent in Hemophilia A disease. This disease is a hereditary 
bleeding disorder affecting approximately one in 20,000 males. 
lOThe structure of factor VIII :C is described in U.S. Patent Appli- 
cations Serial No. 546,650 filed ^ctob^ >28 / 1983 and No. 644,036 
filed ^ugus ^)24, 1984, which are incorporated herein by reference 
and in Nature, 312:306, 307, 326 and 342. 
/ 

150ne of the problems presently encountered with the use of human 
factor VIII :c for treatment of hemophilia arises from its anti- 
genicity. A significant percentage of hemophiliacs have 
developed an immune reaction to the factor VIII :C used for 
their treatment. Non-hemophiliacs can also develop or acquire 

20 hemophilia when their immune systems become sensitized to 
factor VIII :C and produce circulating antibodies or "inhibitors" 
to factor VIII :C. In either case, the effect is the neutrali- 
zation of whatever factor VIII :C is present in the patient, 
making treatment very difficult. Until now, the method of 

25choice for treating hemophiliacs with this problem has been to 
administer, in cases of severe bleeding episodes, non-human 
factor VIII :c, such as treated porcine factor VIII :C. See 
Kernoff et al., Blood 63:31 (1984). However, the antibodies 
which neutralize the clotting ability of human factor VIII:C 

30will react to a varying extent with factor VIII :C of other 
species, and the porcine protein is itself antigenic, thus 
both the short-term and long-term effectiveness of such treat- 
ment will vary. 
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Additionally, patients frequently display adverse reactions to 
infusion with the porcine factor VIII :C. The use of porcine 
factor VIII :C in spite of the risks has been justified because 
of the lack of reliably effective alternatives. Kernoff, 
supra at 38, The present invention provides an alternative to 
the administration of porcine factor VIII :C. 

This invention provides for proteins which have procoagulant 
activity similar to that of factor VIII :C and also have substant- 
ially lower molecular weight. These proteins are schematically 
depicted by formula (1) as follows: ^ 



ft 




wherein A represents a polypeptide sequence substantially dupli- 
cative of the sequence Ala-20 through Arg-759; B represents a 
polypeptide sequence substantially duplicative of the sequence 
Ser-1709 through the C-terminal Tyr-2351; and X represents a 
polypeptide sequence of up to 949 amino acids substantially 
20 duplicative of sequences of amino acids within the sequence 
Ser-760 through Arg-1708. The amino terminus of region X is 
covalently bonded through a peptide bond (designated in 
formula 1) to the carboxy terminus of A. The carboxy terminus 
of region X is likewise bonded to the amino terminus of B. 
25 Numbering of amino acids throughout this disclosure is with 
reference to the numbering of amino acids in Table 1 in which 
the first amino acid, Met, of the leader sequence is assigned 
Number 1. Protein domain X may comprise a continuous but 
shorter sequence selected from the region Ser-760 - through 
30 Arg-1708. Alternatively X may comprise two or more amino acid 
sequences selected from that region which are covalently 
bonded by a peptide bond (maintaining an ascending numerical 
order of amino acids) . 

^/^By way of example, one compound of this invention contains a 
I region X comprising the amino acid sequence of. Ser-760 to Pro- 
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TABLE 1 



5 1 CAATTCCCCACTCCCT/^CTTCCTTAAATCCTCTGCAAACAAATTCCCACTTTTCATTAAATCACAAATT 
TTACT U11J CCCCTCCTCCCAGCTAAACATATTTTAGACAACAATTAACCTrrTCCTTCT^ 



MET 
ATC 


Cln 
CAA 


U« 

ATA 


Clu 
CAC 


Leu 

CTC 


Ser 

TCC 


Thr 
ACC 


Cy. 

TCC 


Phe 

TTC 


Phe 

TTT 


Leu 

CTC 


Cys 
TCC 


Leu 
, CTT 


Leu 

TTC 


Arg 
CCA 


Phe 

TTC 


Cys 
TCC 


Phe 

TTT 


18 


Ser 
ACT 


Ala ' 
CCC 


w Thr 

/acc 


Arg 
ACA 


Arg 
ACA 


Tyr 
TAC 


Tyr 
TAC 


Leu 

CTC 


Cly 
CCT 


Ala 
CCA 


Val 
CTC 


Clu 
CAA 


Leu 
CTC 


Ser 
TCA 


Trp 
TCC 


Asp 

CAC 


Tyr 
TAT 


HET 
ATC 


36 


Cln 
CAA 


Ser 
ACT 


Asp 
GAT 


Leu 

CTC 


Cly 
CCT 


Clu 
CAC 


Leu 

CTC 


Pro 
CCT 


Val 

CTC 


Asp 
CAC 


Ala 

CCA 


Arg 
ACA 


Phe 

TTT 


Pro 
CCT 


Pro 
CCT 


Arg 
ACA 


Val 
CTC 


Pro 
CCA 


54 


Lys 

AAA 


Ser 

TCT 


Phe 

TTT 


Pro 
CCA 


Phe 

TTC 


Aen 

AAC 


Thr 
ACC 


Ser 

TCA 


Val 

CTC 


Val 

CTC 


Tyr 
TAC 


Ly« 
AAA 


Lys 

AAC 


Thr 
ACT 


Leu 
CTC 


Phe 

TTT 


Val 

CTA 


Clu 
CAA 


72 


Phe 
TTC 


Thr 
ACC 


Val 

CTT 


Hla 
CAC 


Leu 
CTT 


Phe 

TTC 


Asn 
AAC 


He 
ATC 


Ala 
CCT 


Lys 
AAC 


Pro 
CCA 


Arg 

ACC 


Pro 
CCA 


Pro 
CCC 


Trp 
TCC 


MET 
ATC 


Cly 
CCT 


Leu 
CTC 


90 


Leu 
CTTA 


Cly 
CCT 


Pro 
CCT 


Thr 
ACC 


lie 

ATC 


Cln 
CAC 


Ala 

CCT 


Clu 
CAC 


Val 
CTT 


Tyr 

. TAT 


Asp 

CAT 


Thr 
ACA 


.Val 

CTC 


Val 

CTC 


lie 

ATT 


Thr 
ACA 


Leu 

CTT 


Lys 
AAC 


108 


Asn 

AAC 


MET 
ATC 


Ala 

CCT 


Ser 

TCC 


Hla 

CAT 


Pro 
CCT 


Val 

CTC 


Ser 
ACT 


Leu 
CTT 


Hie 

CAT 


Ala 

CCT 


Val 

CTT 


Cly 
CCT 


Val 
CTA 


Ser 

TCC 


Tyr 
TAC 


Trp 
TGG 


Lya 
aAa 


126 


Ala 

CCT 


Ser 

TCT 


Clu 
CAC 


Cly 
CCA 


Ala 

CCT 


Clu 
CAA 


Tyr 
TAT 


Aip 

CAT 


Asp 
CAT 


Cln 
CAC 


Thr 
ACC 


Ser 
ACT 


Cln 
CAA 


Arg 
ACC 


Clu 
CAC 


Lys 

AAA 


Clu 
CAA 


Asp 

CAT 


144 


Asp 

CAT 


Lys 
AAA 


Val 
CTC 


Phe 
TTC 


Pro 

CCT 


Cly 
CCT 


Cly 
CCA 


S*r 
ACC 


His 

CAT 


Thr 
ACA 


lyr 
TAT 


Val 

CTC 


Trp 
TCC 


Cln 
CAC 


Val 

CTC 


Leu 
CTC 


Lys 

AAA 


Clu 
CAC 


162 


Asn 
AAT 


Cly 
CCT 


Pro 
CCA 


MET 
ATC 


Ala 

CCC 


Ser 

TCT 


Asp 
CAC 


Pro 
CCA 


Leu 
CTC 


Cys 
TCC 


Leu 

CTT 


Thr 
ACC 


Tyr 
TAC 


Ser 

TCA 


Tyr 
TAT 


Leu 
CTT 


Ser 

TCT 


His 
CAT 


180 


Val 
CTC 


Asp 
CAC 


Leu 

CTC 


Val 

CTA 


Ly« 

AAA 


Asp 
CAC 


Leu 

TTC 


Asn 
AAT 


Ser 
TCA 


Cly 
CCC 


Leu 
CTC 


lie 
ATT 


Cly 
CCA 


Ala 

CCC 


Leu 
CTA 


Leu 

CTA 


Val 

CTA 


Cy. 

TCT 


198 


Acg 

ACA 


Clu 
CAA 


Cly 
CCC 


Ser 

ACT 


Leu 

CTC 


Ala 

CCC 


Lya 
AAC 


Clu 
CAA 


Lys 
AAC 


Thr 
ACA 


Gin 
CAC 


Thr 
ACC 


Leu 

TTC 


HI* 

CAC 


Lys 
AAA 


Phe 
TTT 


Tie 
ATA 


Leu 

CTA 


216 


Leu 
CTT 


Ph« 
TTT 


Ala 

CCT 


Val 

CTA 


Phe 
TTT 


Asp 
CAT 


Clu 
CAA 


Cly 
CCC 


Lys 
AAA 


Ser 

ACT 


Trp 
TCC 


His 
CAC 


Ser 
TCA 


Clu 
CAA 


Thr 
ACA 


Lys 
AAC 


Asn 
AAC 


Ser 
TCC 


234 


Leu 
TTC 


MET 
ATC 


Cln 
CAC 


Asp 

CAT 


Arg 
ACC 


Asp 
CAT 


Ala 

CCT 


Ala 

CCA 


Ser 

TCT 


Ala 

CCT 


/'* 

CCC 


AJa 

CCC 


Trp 
TCC 


Pro 
CCT 


Lys 

AAA 


MET 
ATC 


His 
CAC 


Thr 
ACA 


252 


Val 
CTC 


Asn 
AAT 


Cly 

CCT 


Tyr 
TAT 


Val 

CTA 


Aan 

AAC 


Arg 
ACC 


Ser 
1CT 


CTC 


Pro 
CCA 


Cty 
CCT 


Leu 

CTC 


lie 
ATT 


Cly 
CCA 


Cys 
TCC 


Hts 

CAC 


Arg 

ACC 


Lys 

AAA 


270 


Scr 
TCA 


Val 

CTC 


Tyr 
TAT 


Trp 
TCC 


His 
CAT 


Val 
CTC 


lie 
ATT 


Cly 
CCA 


MET 
ATC 


Cly 
CCC 


Thr 
ACC 


Thr 
ACT 


Pro 
CCT 


Clu 
CAA 


Val 

CTC 


His 

CAC 


Ser 
TCA 


lie 
ATA 


288 


Phe 

TTC 


Lue 

CTC 


Clu 
CAA 


Cly 
CCT 


Hla 

CAC 


Thr 
ACA 


Phe 
TTT 


Leu 
CTT 


Val 
CTC 


Arg 
ACC 


Asn 

AAC 


Mis 
CAT 


Arg 

CCC 


Cln 
CaC 


Ala 

CCC 


Ser 
TCC 


Leu 
TTC 


Clu 
CAA 


306 
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TABLE 1 . continued 



lie 
ATC 


Ser 

TCC 


Pro 
CCA 


lie 
ATA 


Thr 
ACT 


Phe 

TTC 


Leu 
CTT 


Thr 
ACT 


Ala 
CCT 


Cilt 
CAA 


Thr 
ACA 


Leu 
CTC 


Leu 
TTC 


MET 
ATC 


Aap 
CAC 


Leu 
CTT 


Cly 
CCA 


Cln 
CAC 


324 


Phe 

TXT 


Leu 
CTA 


Lou 
CTC 


Phe 
TTT 


Cys 
TCT 


Ills 
CAT 


lie 
ATC 


Scr 
TCT 


Ser 
TCC 


His 

CAC 


Cln 
CAA 


His 
CAT 


Asp 
CAT 


Cly 
CCC 


MET 
ATC 


Clu 
CAA 


Ala 
CCT 


Tyr 
TAT 


342 


Val 
CTC 


Lys 
AAA 


Val 
CTA 


Asp 
CAC 


Ser 
ACC 


Cys 
TCT 


Pro 
CCA 


Clu 
CAC 


Clu 
CAA 


Pro 
CCC 


Cln 
CAA 


l.cu 
CTA 


Arg 
CCA 


MET 
ATC 


Ly* 
AAA 


Asn 
AAT 


Asn 
AAT 


Clu 
CAA 


360 


Clu 
CAA 


Al*. 

CCC 


Clu 
CAA 


Asp 

CAC 


Tyr 
TAT 


Asp 

CAT 


Asp 

CAT 


Asp 

CAT 


Leu 
CTT 


Thr 
ACT 


Asp 
CAT 


Ser 
TCT 


Clu 
CAA 


MET 
ATC 


Asp 

CAT 


Val 
CTC 


Val 
CTC 


Arg 
ACC 


378 


Ph« 

TTT 


Asp 

CAT 


Asp 

CAT 


Asp 
CAC 


Asn 
AAC 


Ser 

TCT 


Pro 
CCT 


Ser 
TCC 


Phe 
TTT 


He 
ATC 


Cln 
CAA 


He 

ATT 


Arg 

CCC 


Ser 

TCA 


Val 
CTT 


Ala 

CCC 


Lys 
AAC 


Lys 

AAC 


396 


His 

CAT 


Pro 
CCT 


Lys 
AAA 


Thr 
ACT 


Trp 
TCC 


Val 
CTA 


Kls 
CAT 


Tyr 
TAC 


lie 
ATT 


Aln 
CCT 


Ala 
CCT 


Clu 
CAA 


Clu 
CAC 


Clu 
CAC 


Asp 
CAC 


Trp 
TCC 


Asp 
CAC 


Tyr 
TAT 


414 


Ala 

CCT 


Pro 
CCC 


Leu 
TTA 


Val 

CTC 


Leu 

CTC 


Ala 

CCC 


Pro 
CCC 


Asp 

CAT 


Asp 
CAC 


Arg 
ACA 


Ser 
ACT 


Tyr 
TAT 


Lys 
AAA 


Ser 


Cln 


Tyr 

TAT 


Leu 


Asa 

AAC 


432 


Asn 
AAT 


Cly 
CCC 


Pro 
CCT 


Cln 
CAC 


Arg 

CCC 


lie 
ATT 


Cly 
CCT 


Arg 

ACC 


Lyn 

AAC 


Tyr 
TAC 


Lys 

AAA 


Lys 

AAA 


Val 
CTC 


Arg 

rr a 


Phe 
TTT 


MKT 
ATC 


Ala 

CCA 


Tyr 
tac 


450 


Thr 
ACA 


Asp 
CAT 


Clu 
CAA 


Thr 

ACC 


Phe 

TTT 


Ly. 

AAC 


Thr 
ACT 


Arg 

CCT 


Clu 
CAA 


Ala 

CCT 


lie 
ATT 


Cln 
CAC 


Kls 
CAT 


Clu 
CAA 


Ser 
TCA 


Cly 

CCA 


Ho 
ATC 


L«tu 
TTC 


468 


Cly 
CCA 


Pro 
CCT 


H.u 

TTA 


Leu 

CTT 


Tyr 
TAT 


Cly 
CCC 


Clu 
CAA 


Val 
CTT 


Cly 
CCA 


Asp 

CAC 


Thr 
ACA 


Leu 

CTC 


Leu 

TTC 


He 
ATT 


He 
ATA 


Phe 

TTT 


Lys 

AAG 


Asn 

AAT 


486 


Cln 
CAA 


Ala 

CCA 


Ser 

ACC 


Arg 
ACA 


Pro 
CCA 


Tyr 
TAT 


Asn 
AAC 


lie 
ATC 


Tyr 
TAC 


Pro 
CCT 


His 
CAC 


Cly 
CCA 


He 

ATC 


Thr 
ACT 


Asp 
CAT 


Val 
CTC 


Arg 
CCT 


Pro 
CCT 


504 


Leu 
TTC 


Tyr 
TAT 


Ser 
TCA 


Arg 
ACC 


Arg 
ACA 


Leu 
TTA 


Pro 
CCA 


Lys 
AAA 


Cly 
CCT 


Vnl 
CTA 


Lys 
AAA 


His 
CAT 


Leu 
TTC 


Lys 

AAC 


Asp 
CAT 


Phe 

TTT 
l i i 


Pro 

CCA 

V»^>A 


He 

ATT 
AA J 


522 


Leu 

CTC 


Pro 
CCA 


Cly 
CCA 


Clu 
CAA 


rio 

ATA 


Phe 

TTC 


Lys 
AAA 


Tyr 
TAT 


Lys 
AAA 


Trp 

TCC 


Thr 


Val 

\* i I* 


Thr 


Val 

r*T a 


Clu 
CAA 


Asp 

pit 


Cly 


Pro 
CCA 


540 


Thr 
ACT 


Lys 
AAA 


Ser 
TCA 


Asp 
CAT 


Pro 
CCT 


Arg 

CCC 


Cys 

TCC 


Leu 
CTC 


Thr 
ACC 


Arg 
CCC 


Tyr 
TAT 


Tyr 
TAC 


Ser 
TCT 


Ser 
ACT 


Phe 
TTC 


Val 
CTT 


Asn 

AAT 


MET 
ATC 


558 


Clu 
CAC 


Arg 
ACA 


Asp 

CAT 


Leu 
CTA 


Ala 

CCT 


Ser 

TCA 


Cly 
CCA 


Leu 

CTC 


lie 
ATT 


Cly 

CCC 


Pro 
CCT 


i.eu 
CTC 


Leu 

CTC 


Ho 
ATC 


Cys 
TCC 


Tyr 

TAC 


Ly« 

AAA 


Clu 
CAA 


576 


Ser 
TCT 


Val 
CTA 


Asp 
CAT 


Cln 
CAA 


Arg 
ACA 


Cly 
CCA 


Asn 

AAC 


Cln 
CAC 


He 
ATA 


MLT 
ATC 


S»»r 
TCA 


Asp 
CAC 


Lys 

AAC 


Arg 
ACC 


Asn 
AAT 


Val 
CTC 


lie 

ATC 


Leu 

CTC 


594 


Ph. 

rrr 


Ser 

TCT 


Vnl 
CTA 


Plte 
TTT 


Asp 

CAT 


Cln 
CAC 


Asn 
AAC 


Arg 

CCA 


Scr 

ACC 


Trp 
ICC 


Tyr 
TAC 


Leu 

CTC 


Thr 
ACA 


Clu 
CAC 


Asn 

AAT 


I lc 
ATA 


Cln 
CAA 


Arg 

CCC 


612 


Phe 

TTT 


Lett 
CTC 


Pro 
CCC 


Asn 
AAT 


Pro 
CCA 


Ala 
CCT 


Cly 
CCA 


Val 

ctc 


Cln 
CAC 


L*u 
CTT 


Clu 
CAC 


Asp 

cat 


Pro 
CCA 


Clu 
CAC 


Phe 
TTC 


Cln 
CAA 


Ala 
CCC 


Scr 

TCC 


630 


Asn 
AAC 


lie 
ATC 


MET 
ATC 


His 
CAC 


Ser 
ACC 


lie 
ATC 


Asn 

AAT 


Cly 
CCC 


Tyr 
7AT 


Val 
CTT 


Phe 

TTT 


Afcp 

CAT 


Ser 

ACT 


Leu 

TTC 


Cln 
CAC 


Leu 
TTC 


Str 
TCA 


V«i 
CTT 


648 



5" 
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TABLE 1 , continued 



Cys* 

T/*T 
lis I 


Leu 

lib 


His 

LA i 


Clu 
p if 


Val 

CTC 


Ala 

cc \ 


Tyr 

TAC 
* AL 


Trp 

TCC 
i LL 


Tyr 

Tap 

1 «VL 


lie 

\TT 

All 


Leu 

PT A 
L I A 


Sitr 

ACC 

ALL 


lie 

Ail 


Cly 

t'C A 

LLA 


AJa 
CCA 


Cln 
CAC. 


Thr 
ACT 


Asp 
CAC 


666 


Phe 
TTC 


. Leu 
CTT 


Ser 
TCT 


Vol 
CTC 


Phe 
TTC 


Phe 

TTC 


Ser 

TCT 
4.L A 


c;iy 

CCA 
LLA 


Tyr 
tat 

AAA 


Thr 

ACC 
ALL 


rite 

TTP 
1 iL 


Lys 
AAA 


His 

f * AP 
LAL 


Lys 
AAA 


MET 

A Tl* 

ATL 


Val 
CTC 


Tyr 
TAT 


Clu 
CAA 


684 


Asp 

CAC 


Thr 
ACA 


Leu 
CTC 


Thr 

ACC 


Leu 

CTA 

L 4 A 


Phe 

TTC 
i AL 


Pro 

CC A 
LL A 


Phe 

TTP 
I I L 


Ser 

TCA 
i LA 


Cly 

PP A 
LLA 


Clu 

P A A 

LAA 


Thr 

A/T 

ALT 


Val 

LTL 


Phe 

TTC 


MET 
ATC 


Scr 
TCC 


MET 
ATC 


Clu 
CAA 


702 


Asn 

AAC 


Pro 
CCA 


Cly 

CCT 


Leu 

CTA 


Tr r 

TCC 


lie 

ATT 

Ai i. 


Leu 

CTC 
L 1 L 


Cly 

WW 


Cys 

TCC 
1 LL 


His 

CAP 
LAL 


Asn 

A AP 

AAL 


Scr 

TP A 

I LA 


Asp 

PAP 

LAL 


Phe 

TTT 

ill 


Arg 

ppp 

LLL 


Asn 

A AP 

AAL 


Arg 

A P A 

ALA 


Cly 

cc c~ 
LLL 


720 


MET 
ATC 


Thr 
ACC 


Ala 

CCC 


Leu 

TTA 
X 1A 


Leu 

CTC 


Lys 

AAC 
AAw 


Val 

CTT 
L 1 I 


Ser 

TPT 

iLT 


Scr 

A PT 
AL A 


Cys 

TPT 

iLT 


A*p 

t * A C 

LAL 


Lys 

A \ C 

AAL 


Asn 

A A 

AAL 


Thr 

A /"T 

alt 


Cly 

LLT 


Asp 

CAT 


Tyr 
TAT 


Tyr 
TAC 


738 


CIu 
CAC 


A**p 

CAC 


Ser 

ACT 
AL 1 


Tyr 

TAT 


Clu 

wAA 


Asp 

PAT 
LAX 


He 

ATT 

AAA 


Ser 

rr a 
I LA 


Ala 

f*c a 
LLA 


Tyr 
TAC 


Leu 
TTC 


l.eu 
CTC 


Ser 

ACT 


Ly* 
AAA 


Asn 
AAC 


Asn 
AAT 


Ala 

CCC 


lie 
ATT 


756 


CIu 
CAA 


Pro 
CCA 


Arg 
ACA 


Ser 

ACC 
AUU 


Phe 

TTC 


Scr 

ILL 


Cln 

P A P 

LAL 


Awn 

AAT 

AAA. 


Ser 

Tr a 

TLA 


Arg 

A C A 

ALA 


Mis 
CAC 


Pro 
CCT 


Ser 
ACC 


Thr 
ACT 


Arg 
ACC 


Cln 
CAA 


Lys 

AAC 


Cln 
CAA 


774 


Phe 
TTT 


Aen 

AAT 


Ala 

CCC 


Thr 

ACC 
ALL 


Thr 

ACA 
ALA 


lie 

ATT 
Ai i 


Pro 

PP A 
LLA 


Clu 

Pa a 
LAA 


Asn 

» i T 

AAT 


Asp 

/* A C 

CAC 


He 

ATA 


Clu 
CAC 


Ly* 
AAC 


Thr 
ACT 


Asp 

CAC 


Pro 

CCT 


Trp 
TCC 


Phe 

TTT 


792 


Ala 

LLA 


His 

CAC 


Arg 

ALA 


Thr 

ALA 


Pre 

LIT 


MET 

AIL 


Pro 
CCT 


Lya 

AAA 


lie 
ATA 


Cln 
CAA 


Asn 

AAT 


Val 
CTC 


Scr 
TCC 


Ser 
TCT 


Ser 
ACT 


Asp 
CAT 


Leu 
TTC 


Leu 

TTC 


810 


WET 

ATC 

AIM 


Leu 

CTC 
LJ L 


L«u 

ttp 

lib 


Arg 

P/~ A 


Gin 
CAC 


Ser 
ACT 


Pro 
CCT 


Thr 
ACT 


Pro 
CCA 


His 

CAT 


Cly 
CCC 


L*?u 
CTA 


Ser 

TCC 


Leu 

TTA 


Scr 
TCT 


Asp 

CAT 


Leu 

CTC 


Cln 
CAA 


828 


Clu 
CAA 


AJa 
CCC 


Lys 

AAA 
AAA 


Tyr 

TAT 


Clu 

Cap 
LAL 


Thr 

APT 
ALT 


Phe 

\ 

ill 


Ser 
TCT 


Asp 
CAT 


Asp 
CAT 


Pro 
CCA 


Ser 
TCA 


Pro 

CCT 


Cly 
CCA 


Aln 
CCA 


Tie 

ATA 


Asp 
CAC 


Scr 
ACT 


846 


Asn 
AAT 


Asn 
AAC 


Ser 

ACC 


Leu 

CTC 

L A L 


Scr 

TCT 


Clu 

Pa a 
LAA 


MET 

A T#"* 


Thr 
ACA 


His 
CAC 


Phe 
TTC 


Arg 
ACC 


Pro 
CCA 


Cln 
CAC 


Leu 
CTC 


His 
CAT 


His 
CAC 


Scr 
ACT 


Cly 

CCC 


864 


Asp 

CAC 


MET 
ATC 


Val 
CTA 


Phe 

TTT 


Thr 
ACC 


Pro 
CCT 


Clu 
CAC 


Ser 
TCA 


Cly 
CCC 


Leu 
CTC 


Cln 
CAA 


Leu 
TTA 


Are 
ACA 


Leu 
TTA 


Asn 

AAT 


Clu 

CAC 


Lys 
AAA 


Leu 

CTC 


882 


Cly 
CCC 


Thr 
ACA 


Thr 
ACT 


Ala 

CCA 


Ala 

CCA 


Thr 
ACA 


Clu 
CAC 


Leu 

TTC 


Ly* 
AAC 


Lys 
AAA 


Leu 
CTT 


Asp 

CAT 


The 
TTC 


Lys 

AAA 


Val 
CTT 


jur 
TCT 


Ser 

ACT 


Thr 
ACA 


900 


Ser 
TCA 


Asn 
AAT 


Aen 
AAT 


Leu 

CTC 


lie 
ATT 


Ser 
TCA 


Thr 
ACA 


lie 
ATT 


Tro 
CCA 


Ser 

TCA 


Asp 
CAC 


Asn 
AAT 


l.eu 
TTC 


Ala 
CCA 


Ala 

CCA 


Cly 
CCT 


Thr 
ACT 


Asp 

CAT 


918 


Asn 
AAT 


Thr 
ACA 


Ser 

ACT 


Ser 
TCC 


Leu 
TTA 


Cly 

cca 


Pro 
CCC 


Fro 
CCA 


Ser 
ACT 


MET 
ATC 


Pro 
CCA 


Val 
CTT 


His 
CAT 


Tyr 
TAT 


Asp 
CAT 


Ser 

ACT 


Cln 
CAA 


Leu 
TTA 


936 


Asp 
CAT 


Thr 
\CC 


Thr 
ACT 


Leu 
CTA 


Phe 
TTT 


Cly 
CCC 


Lys 
A,\A 


Lya 
AAC 


Scr 
TCA 


Ser 

TCT 


Pro 
CCC 


Leu 
CTT 


Thr 
ACT 


Clu 
CAC 


Ser 
TCT 


Cly 
CCT 


Cly 
CCA 


Pro 
CCT 


954 


L«u 

CTC 


Ser 
ACC 


Leu 
TTC 


Ser 
ACT 


Clu 
CAA 


Clu 
CAA 


Asn 
AAT 


Asn 
AAT 


Asp 
CAT 


Ser 

TCA 


l.ys 
aa<; 


Leu 

TTC 


Lc-i. 
TTA 


Clu 
CAA 


Ser 

TCA 


Cly 
CCT 


Leu 

TTA 


MET 
ATC 


972 


Asn 
AAT 


Ser 
ACC 


CJn 
CAA 


Clu 
CAA 


Ser 

ACT 


Ser 
TCA 


Trp 
TCC 


Cly 
CCA 


Lye 
AAA 


Asn 
AAT 


Vsl 

CTA 


Ser 

TCC 


Ser 
TCA 


Thr 
ACA 


Clu 

CAC 


Ser 
ACT 


Cly 
CCT 


Arg 
ACC 


990 




TABLE 1 . continued 



Leu 
TTA 


Phe 
TTT 


Lys 
AAA 


Cly 
CGC 


Lys 
AAA 


Arg 
ACA 


A In 
CCT 


His 
CAT 


Cly 
CCA 


Pro 
CCT 


Leu 
TTA 


Phe 
TTC 


Lya 
AAA 


Val 
CTT 


Scr 
ACC 


lie 
ATC 


Ser 
TCT 


Lfcii 
TTC 


Leu 
TTA 


Lys 

AAC 


Ala 
CCA 


TUr 
ACT 


Asn 
AAT 


Arg 
ACA 


Lys 
- AAC 


Thr 
ACT 


His 
CAC 


lie 
ATT 


Asp 
CAT 


Cly 
CCC 


Pro 
CCA 


Ser 
TCA 


Vol 
CTC 


Trp 
TCC 


Cln 
CAA 


Asn 
AAT 


ile 
ATA 


Leu 
TTA 


Clu 
CAA 


Ser 
ACT 


Pro 
CCT 


Leu 

TTC 


lie 
ATT 


His 
CAT 


Asp 

CAC 


Arg 
ACA 


MET 
ATC 


Leu 
CTT 


MET 
ATC 


Asp 
CAC 


Asn 
AAT 


His 
CAT 


MET 
ATC 


Ser 

TCA 


Asn 
AAT 


Lys 
AAA 


Thr 
ACT 


Thr 
ACT 


Scr 
TCA 


Scr 
TCA 


Lys 
AAA 


Lys 
AAA 


Clu 
CAC 


Cly 
CCC 


Pro 
CCC 


lie 
ATT 


Pro 
CCA 


Pro 
CCA 


Asp 
CAT 


Ala 

CCA 


Lya 
AAC 


MET 
ATC 


Leu 

CTA 


Phe 
TTC 


Leu 
TTC 


Pro 

CCA 


Clu 
CAA 


Ser 
TCA 


Ala 
CCA 


Arg 

ACC 


Asn 
AAC 


Ser 
TCT 


Leu 

crc 


Asn 
AAC 


Ser 
TCT 


Cly 
CCC 


Cln 
CAA 


Cly 
CCC 


Pro 

CCC 


Sctr 
ACT 


Pro 

CCA 


Clu 
CAA 


Lys 
AAA 


Ser 

TCT 


Val 

CTC 


Clu 
CAA 


Cly 
CCT 


Cln 
CAC 


Asn 

AAT 


Phe 
TTC 


Val 
CTA 


Cly 
CCA 


Lys 
AAC 


Cly 
CCT 


Clu 
CAA 


Die 
TTT 


Thr 
ACA 


Lys 
AAC 


Asp 
CAC 


Val 
CTA 


Scr 
AGC 


Scr 
ACC 


Arg 
ACA 


Asn 
AAC 


Leu 
CTA 


Phe 
TTT 


Leu 
CTT 


Thr 
ACT 


Asn 
AAC 


Leu 
TTC 


His 
CAC 


Asn 
AAT 


Cln 
CAA 


Clu 
CAA 


Lys 
AAA 


Lys 
AAA 


llQ 

ATT 


Cln 
CAC 


Clu 
CAA 


Clu 
CAA 


Cln 
CAA 


Clu 
CAC 


Asn 
AAT 


Val 
CTA 


Val 
CTT 


Leu 
TTC 


Pro 
CCT 


Cln 
CAC 


He 

ATA 


His 

CAT 


HET 
ATC 


Lys 
AAC 


Asn 
AAC 


Leu 
CTT 


Phe 
TTC 


Leu 
TAA 


Leu 
CTC 


Ser 

ACC 


Thr 
ACT 


Arg 
ACC 


Cly 
CCC 


Ala 
CCA 


Tyr 
TAT 


Ala 
CCT 


Pro 
CCA 


Val 
CTA 


Leu 
CTT 


Cln 
CAA 


. Asp 

CAT 


Phe 
TTT 


Arg 
ACA 


Thr 
ACA 


Lys 

AAC 


Lys 
AAA 


His 
CAC 


TUr 
ACA 


Ala 
CCT 


His 
CAT 


Phe 

TTC 


Scr 
TCA 


Clu 
CAA 


Cly 
CCC 


Leu 

TTC 


Cly 
CCA 


Asn 
AAT 


Cln 
CAA 


Thr 

ACC 


Lys 

AAC 


Cln 
CAA 


He 

ATT 


Arg 
ACC 


lie 

ATA 


Ser 

TCT 


Pro 
CCT 


Asn 
AAT 


Thr 
ACA 


Ser 
ACC 


Cln 
CAC 


Cln 
CAC 


Asn 
AAT 
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Ala 
CCT 


Lett 
TTC 


l.cti 
TTC 


xnr 
ACT 


Lys 
AAA 


A:;p 
CAT 


As n 
AAT 


A t» 1,008 

AIM ' 

CCC 


Thr 
ACA 


At.n 
AAC 


Lys 
AAA 


i ii r 
ACT 


TCC 


Asn 
AAT 


Asn 
AAT 


«. r 1,026 
TCA 


i ro 
CCA 


C — v 

TCA 


Leu 
TTA 


Leu 
TTA 


r l » 
l ie 

ATT 


C 1 it 
will 

CAC 


A Aft 

nin 

AAT 


ACT 


Asp 
CAC 


k nr 
ACT 


CAC 


i tie 
TTT 


Lys 
AAA 


Lys 
AAA 


» a i 
CTC 


Thr 1 06 2 

ACA 


Lys 
AAA 


Asn 
AAT 


Aia 
CCT 


i ur 
ACA 


Aia 

CCT 


Leu 

TTC 


Arg 
ACC 


l-„ 1 , 080 
Leu ~ 

CTA 


Lys 
AAA 


Asn 

ACC 


nr-T 
ATC 


c iu 
CAA 


PI Li 
ATC 


val 
CTC 


c in 
CAA 


Clll 1 , 098 
cin ' 

CAC 


Cln 
CAA 


Asn 
AAT 


Pro 
CCA 


Asp 
CAT 


riLT 
ATC 


Ser 

ICC 


rne 
TTC 


PK« 1 116 
r nc -* » a a. a 

TTT 


Tip 
TCC 


t ie 
ATA 


CAA 


Arg 

ACC 


i nr 
ACT 


Ufa 

CAT 


Cl> 
CCA 


i v-« 1 134 

L> S a. , j. j i 

AAC 


ITO 

CCA 


uys 
AAC 


C 111 
CAA 


LCU 

TTA 


vai 
CTA 


C a v 

aer 

TCC 


LCU 

TTA 


r1 „ 1 , 152 
CCA 


Leu 
TTC 


TCT 


C J U 

CAC 


Lys 
AAA 


Asn 
AAC 


L>s 
AAA 


val 
CTC 


Vn i 1.170 

Vfl X a. , a. / w 

CTA 


Cly 
CCA 


Lett 
CTC 


Lys 
AAA 


r\,t 
CAC 


UfT 

riLi 
ATC 


val 
CTT 


i ue 
TTT 


Pro 1.188 
r TO Af*w»w 

CCA 


ASp 

CAT 


A^n 
AAT 


Leu 
TTA 


HIS 

CAT 


C1U 
CAA 


Asn 
AAT 


Asn 
AAT 


TKr- 1 206 
ACA 


lie 
ATA 


Clu 
CAA 


Lys 
AAC 


Lys 
AAC 


c* lu 
CAA 


Thr 
ACA 


Leu 

TTA 


lie J. f h 
ATC 


Thr 
ACA 


Val 
CTC 


Thr 
ACT 


Cly 
CCC 


Thr 
ACT 


Lys 
AAC 


Asn 
AAT 


Phe 1,242 
TTC 


Cln 
CAA 


Asn 
AAT 


Val 
CTA 


Clu 
CAA 


cly 

CCT 


S«r 
TCA 


Tyr 
TAT 


Clu 1, 260 

CAC 


Arg 

ACC 


Ser 
TCA 


Leu 
TTA 


Asn 
AAT 


Asp 
CAT 


Scr 
TCA 


Thr 
ACA 


Asn 1,278 
AAT 


Lys 
AAA 


Lys 
AAA 


Cly 
CCC 


Clu 
CAC 


Clu 
CAA 


Clu 
CAA 


Asn 
AAC 


Leu 1,296 
TTC 


Val 
CTA 


Clu 
CAC 


Lys 
AAA 


Tyr 
TAT 


Ala 

CCA 


Cys 
TCC 


Thr 
ACC 


Thr 1,314 
ACA 


Phe 
TTT 


Val 

CTC 


Thr 
ACC 


Cln 
CAA 


Ar R 
CCT 


Ser 
ACT 


Ly* 
AAC 


Arg 1,332 

ACA 



7 
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TABLE 1 , continued 



Ala 
CCT 


Leu 
TTC 


Lys 
AAA 


Cln 
CAA 


Phe 
TTC 


Arr, 
ACA 


Leu 
CTC 


Pro 
CCA 


Leu 
CTA 


Clu 
CAA 


Clu 
CAA 


Thr 

ACA 


Clu 
CAA 


Leu 

CTT 


Clu 
CAA 


Lys 

AAA 


Arg 
ACC 


He 
ATA 


1,350 


lie 
ATT 


Val 
f.TC 


Asp 
CAT 


Asp 

CAC 


Thr 
ACC 


Ser 
TCA 


Thr 
ACC 


Cln 
CAC 


Trp 
TCC 


Ser 
TCC 


Lys 
AAA 


Asu 
AAC 


MKT 
ATC 


Lys 
AAA 


His 
CAT 


l.CU 

TTC 


Thr 
ACC 


Pro- 
CCC 


1,368 


Ser 

ACC 


Thr 
ACC 


Leu 
CTC 


Thr 
ACA 


CJn 
CAC 


Tlo 
ATA 


Asp 
CAC 


Tyr 
TAC 


Asn 
AAT 


Clu 
CAC 


Lys 
AAC 


Clu 
CAC 


Lys 
AAA 


Cly 
CCC 


Ala 
CCC 


IJe 
ATT 


Thr 
ACT 


Cln 
CAC 


1,386 


Scr 
TCT 


Pro 
CCC 


Leu 

TTA 


Ser 
TCA 


Asp 
CAT 


Cy. 

TCC 


Leu 

CTT 


Thr 
ACC 


Arg 
ACC 


Ser 

ACT 


His 
CAT 


Ser 
ACC 


He 

ATC 


Pro 
CCT 


Cln 
CAA 


Ala 
CCA 


Asn 

AAT 


Arg 
ACA 


1,404 


Scr 

TCT 


Pro 
CCA 


Leu 

TTA 


Pro 

CCC 


lie 

ATT 


Ala 

CCA 


Lys 
AAC 


Val 
CTA 


Ser 
TCA 


Ser 
TCA 


Phu 
TTT 


Pro 
CCA 


Ser 

TCT 


Ilo 
ATT 


Arg 
ACA 


Pro 
CCT 


lie 
ATA 


Tyr 
TAT 


1,422 


Leu 

CTC 


Thr 
ACC 


Arg 
ACC 


Val 
CTC 


Leu 

CTA 


Phe 

TTC 


Cln 
CAA 


Asp 

CAC 


Asn 
AAC 


Ser 
TCT 


Ser 
TCT 


His 

CAT 


Leu 
CTT 


Tro 
CCA 


Ala 

CCA 


Ala 

CCA 


Scr 
TCT 


Tyr 
TAT 


1,440 


Arg 
ACA 


Lys 
AAC 


Lys 
AAA 


Asp 
CAT 


Ser 
TCT 


Cly 
CCC 


Val 
CTC 


Cln 
CAA 


Clu 
GAA 


Ser 
ACC 


Ser 
ACT 


Hie 
CAT 


Phe 
TTC 


Leu 
TTA 


Cln 
CAA 


Cly 
CCA 


Ala 
CCC 


Lys 

AAA 


1,458 


Ey. 
AAA 


Asn 
AAT 


Asn 
AAC 


Leu 

CTT 


Ser 
TCT 


Leu 
TTA 


Ala 

CCC 


lie 
ATT 


Leu 
CTA 


Thr 
ACC 


Leu 
TTC 


Clu 

CAC 


MET 
ATC 


Thr 
ACT 


Cly 
CCT 


A?*p 
CAT 


Cln 
CAA 


Arr, 
ACA 


1,476 


Clu 
CAC 


Val 
CTT 


Cly 

ccc 


Ser 

TCC 


Leu 
CTC 


Cly 
CCC 


Thr 
ACA 


Scr 
ACT 


Ala 

CCC 


Thr 
ACA 


Asn 
AAT 


Scr 
TCA 


Val 
CTC 


Thr 
ACA 


Tyr 
TAC 


Lys 

AAC 


• Ly. 
AAA 


Val 
CTT 


1,494 


Clu 
CAC 


Asn 
AAC 


Thr 
ACT 


Val 
CTT 


Leu 

CTC 


Pro 

CCC 


Lys 
AAA 


Pro 
CCA 


A*p 

CAC 


Leu 
TTC 


Pre 
CCC 


Lys 
AAA 


Thr 
ACA 


Ser 
TCT 


Cly 
CCC 


Ly* 

AAA 


Val 
CTT 


Clu 
CAA 


1,512 


Leu 

TTC 


Leu 

CTT 


Pro 
CCA 


Lys 
AAA 


Val 
CTT 


His 
CAC 


lie 
ATT 


Tyr 
TAT 


Cln 
CAC 


Lys 

AAC 


Asp 

CAC 


Leu 
CTA 


Plte 

TTC 


Pro 

CCT 


Thr 

ACC 


Clu 
CAA 


Thr 
ACT 


Ser 

ACC 


1, 530 


Aan 
AAT 


Cly 
CCC 


Ser 
TCT 


Pro 
CCT 


Cly 
CCC 


His 
CAT 


Leu 
CTC 


Asp 

CAT 


Leu 
CTC 


Val 
CTC 


Clu 
CAA 


Cly 
CCC 


Ser 
ACC 


Leu 
CTT 


Leu 

err 


Cln 
CAC 


Cly 
CCA 


Thr 
ACA 


1,548 


Clu 
CAC 


Cly 
CCA 


Ala 

CCC 


lie 

ATT 


Ly. 

AAC 


Trp 
TCC 


Asn 
AAT 


Clu 
CAA 


Ala 
CCA 


Asn 

AAC 


Arg 
ACA 


Pro 
CCT 


Cly 
CCA 


Lys 
AAA 


Val 
CTT 


Pro 

CCC 


Phe 

TTT 


Leu 
CTC 


1,566 


Arg 
ACA 


Val 
CTA 


Ala 

CCA 


Thr 
ACA 


Clu 
CAA 


Ser 
ACC 


Ser 

TCT 


Ala 
CCA 


Lys 
AAC 


Thr 
ACT 


Pro 
CCC 


Scr 
TCC 


Lvs 

AAC 


Leu 
CTA 


TTC 


Asn 

/vsp 
CAT 


Pro 
CCT 


Leu 
CTT 


1, 584 


Ala 

CCT 


Trp 
TCC 


Asp 

CAT 


Asn 
AAC 


His 

CAC 


Tyr 
TAT 


Cly 
CCT 


Thr 
ACT 


Cln 
CAC 


lie 
ATA 


Pro 
CCA 


Lys 
AAA 


Clu 
CAA 


Clu 
CAC 


Trp 

TCC 


Lys 
AAA 


Ser 

TCC 


Cln 
CAA 


1,602 


Clu 
CAC 


Ly. 

AAC 


Ser 
TCA 


Pro 
CCA 


Clu 
CAA 


Lys 
AAA 


Thr 
ACA 


Ala 
CCT 


Phe 
TTT 


Lys 
AAC 


Lys 

AAA 


Lys 
AAC 


Asp 
CAT 


Thr 
ACC 


lie 
ATT 


Leu 
TTC 


s«r 

TCC 


Leu 

CTC 


1, 520 


Aan 
AAC 


Ala 
CCT 


cy« 

TCT 


Clu 
CAA 


Ser 

ACC 


Asn 
AAT 


His 
CAT 


Ala 

CCA 


He 
ATA 


Ala 
CCA 


Ala 

CCA 


Ik 
ATA 


A.-n 
AAT 


Clu 
CAC 


Cly 
CCA 


Cln 
CAA 


Asn 
AAT 


Lys 

AAC 


1,638 


Pru 
CCC 


Clu 
CAA 


lie 
ATA 


Clu 
CAA 


Val 
CTC 


Thr 
ACC 


Trp 
TCC 


Alii 
CCA 


Lys 
AAC 


Cln 
CAA 


Cly 
CCT 


Arg 

ACC 


Thr 
ACT 


Clu 
CAA 


Arg 

ACC 


Leu 
CTC 


Cys 
TCC 


Scr 
TCT 


1,656 


Cln 
CAA 


A^n 
AAC 


Pro 
CCA 


Pro 
CCA 


Vnl 

CTC 


t.eu 
TTC 


Lys 
AAA 


Arg 

CCC 


His 
CAT 


Cln 


Arg 


Clu 
CAA 


He 
ATA 


Thr 
ACT 


Arg 
CCT 


Thr 
ACT 


Ihr 

ACT 


Leu 
CTT 


1,674 
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TABLE 1 . continued 



Cin 
CAC 


Scr 
TCA 


Asp 
CAT 


Cin 
CAA 


Clu 
CAC 


Clu 
CAA 


lie 
ATT 


Asp 
CAC 


Tyr 
TAT 


Asp 

CAT 


Asp 
CAT 


Thr 
ACC 


IU 
ATA 


Ser 
TCA 


Val 
CTT 


Clu 
CAA 


MET 
ATG 


Lys 
AAC 


1,692 


Lys 

AAC 


Clu 
CAA 


Asp 

CAT 


The 
TTT 


Asp 
CAC 


He 
ATT 


Tyr 
TAT 


Asp 
CAT 


Clu 
CAC 


Asp 
CAT 


Clu 
CAA 


Asn 
AAT 


Cin 
CAC 


Scr 
ACC 


Pro 
CCC 


Arg 

CCC 


Scr 
ACC 


Phe 
TTT 


1,710 


Cin 
CAA 


Ly. 
AAC 


Lys 
AAA 


Thr 
ACA 


Arg 

CCA 


His 
CAC 


Tyr 
TAT 


Phe 

TTT 


He 

ATT 


Ala 
CCT 


Ala 
CCA 


Val 
CTC 


Clu 
CAC 


Arg 
ACC 


Leu 

CTC 


Trp 
TCC 


Aap 
CAT 


Tyr 
TAT 


1,728 


Cly 
CCC 


MET 
ATC 


Ser 
ACT 


Ser 

ACC 


Ser 

TCC 


Pro 
CCA 


HI. 

CAT 


Val 

CTT 


Leu 
CTA 


Arg 
ACA 


Asn 
AAC 


Arg 
ACC 


Ala 
CCT 


Cin 
CAC 


Scr 

ACT 


Cly 
CCC 


Ser 

ACT 


Val 
CTC 


1,746 


Pro 
CCT 


Cin 
CAC 


Phe 

TTC 


Ly. 

AAC 


Lys 

AAA 


Val 
CTT 


Val 

CTT 


Phe 

TTC 


Cin 
CAC 


Clu 
CAA 


Phe 
ITT 


Thr 
ACT 


Asp 

CAT 


Cly 
CCC 


Scr 
TCC 


Phe 

TTT 


Thr 
ACT 


Cin 
CAC 


1,764 


Pro 

CCC 


Leu 
TTA 


Tyr 
TAC 


Arg 

CCT 


Cly 
CCA 


Clu 
CAA 


Leu 

CTA 


A*n 
AAT 


Clu 
CAA 


His 
CAT 


Leu 
TTC 


Cly 
CCA 


Leu 

CTC 


Leu 
CTC 


Cly 

CCC 


Pro 

CCA 


Tyr 
TAT 


lie 

ATA 


1,782 


Arg 
ACA 


Ala 

CCA 


Clu 
CAA 


Val 
CTT 


Clu 
CAA 


Asp 
CAT 


Asn 
AAT 


He 
ATC 


MET 
ATC 


Val 
CTA 


Thr 
ACT 


Phe 
TTC 


Arg 
ACA 


Asn 
AAT 


Cin 
CAC 


Ala 

CCC 


Ser 

TCT 


Arg 
CCT 


1,800 


Pro 
CCC 


Tyr 
TAT 


Ser 
*TCC 


Phe 
TTC 


Tyr 
TAT 


Ser 
TCT 


Ser 
ACC 


Leu 

CTT 


11* 
ATT 


Sor 
TCT 


Tyr 
TAT 


Clu 
CAC 


Clu 
CAA 


Asp 
CAT 


Cin 
CAC 


Arg 
ACC 


Cin 
CAA 


Cly 
CCA 


1,818 


AU 

CCA 


Clu 
CAA 


Pro 
CCT 


Arg 
ACA 


Ly. 
AAA 


Asn 
AAC 


Phe 

TTT 


Val 

CTC 


Lys 

AAC 


Pro 
CCT 


Asn 
AAT 


Clu 
CAA 


Thr 
ACC 


Lys 
AAA 


Thr 
ACT 


Tyr 
TAC 


Phe 
TTT 


Trp 
TCC 


1,836 


Ly. 
AAA 


Val 
CTC 


Cin 
CAA 


Hit 

CAT 


His 
CAT 


MET 
ATC 


Ala 

CCA 


Pro 
CCC 


Thr 
ACT 


Lys 
AAA 


Asp 

CAT 


Clu 
CAC 


Phe 

TTT 


Asp 

CAC 


cys 
TCC 


Ly. 
AAA 


Ala 

CCC 


Trp 
TCC 


1,854 


Ala 
CCT 


Tyr 
TAT 


Phe 
TTC 


Ser 
TCT 


Asp 
CAT 


Val 
CTT 


Asp 

CAC 


Leu 

CTC 


Clu 
CAA 


Lys 

AAA 


Acp 
CAT 


Val 
CTC 


His 
CAC 


Ser 
TCA 


Cly 
CCC 


Leu 
CTC 


He 
ATT 


Cly 
CCA 


1,872 


Pro 
CCC 


Leu 

CTT 


Leu 

CTC 


Val 
CTC 


Cys 
TCC 


His 
CAC 


Thr 
ACT 


A.n 

AAC 


Thr 
ACA 


Leu 

CTC 


Asn 
AAC 


Pro 

CCT 


Ala 
CCT 


Kls 
CAT 


Cly 
CCC 


Arg 
ACA 


Cin 
CAA 


Val 
CTC 


1,890 


Thr 
ACA 


Val 
CTA 


Clu 
CAC 


Clu 
CAA 


Phe 
TTT 


Ala 
CCT 


Leu 
CTC 


Phe 

TTT 


Phe 
TTC 


Thr 
ACC 


lie 
ATC 


Phe 
TTT 


Asp 
CAT 


Clu 
CAC 


Thr 
ACC 


Lys 
AAA 


Scr 
ACC 


Trp 

TCC 


1,908 


Thy 
TAC 


Phe 

TTC 


Thr 
ACT 


Clu 
CAA 


Asn 

AAT 


MET 
ATC 


Clu 
CAA 


Arg 
ACA 


Asn 
AAC 


Cy. 
TCC 


Arg 
ACC 


Ala 

CCT 


Pro 
CCC 


Cy. 

TCC 


Asn 

AAT 


He 
ATC 


Cin 
CAC 


MET 
ATC 


1,926 


Clu 
CAA 


Asp 
CAT 


Pro 

CCC 


Thr 
ACT 


Phe 

TTT 


Lys 
AAA 


Clu 
CAC 


Asn 
AAT 


Thr 
TAT 


Arg 
CCC 


Phe 
TTC 


His 
CAT 


Ala 
CCA 


lie 

ATC 


Asn 
AAT 


Cly 
CCC 


Tyr 
TAC 


lie 
ATA 


1,944 


HFTT 
ATC 


Asp 
CAT 


Thr 
ACA 


Leu 
CTA 


Pro 
CCT 


Cly 

CCC 


Leu 
TTA 


Val 
CTA 


MET 
ATC 


Ala 

CCT 


Cin 
CAC 


Asp 
CAT 


Cin 
CAA 


Arg 

ACC 


He 
ATT 


Arg 

CCA 


Trp 
TCC 


Tyr 
TAT 


1,962 


Leu 

CTC 


Leu 
CTC 


Ser 
ACC 


MET 
ATC 


Cly 
CCC 


Ser 
ACC 


Asn 
AAT 


Clu 
CAA 


Asn 
AAC 


llv 
ATC 


Ills 
CAT 


Ser 

TCT 


IU 

ATT 


HI. 

CAT 


Phe 
TTC 


Ser 

ACT 


Cly 

a: A 


His 

CAT 


1,980 


Val 
CTC 


Phe 

TTC 


Thr 
ACT 


Val 

CTA 


Arg 

CCA 


Ly* 

AAA 


Ly. 
AAA 


Clu 
CAC 


Clu 
CAC 


Tyr 
TAT 


Lys 
AAA 


MET 
ATC 


Al4 

CCA 


Leu 
CTC 


Tyr 
TAC 


Asn 
AAT 


l.eu 
CTC 


Tyr 
TAT 


1,998 


Pro 
CCA 


Cly 
CCT 


VaI 
CTT 


Phe 
TTT 


Clu 
CAC 


Thr 
ACA 


Val 
CTC 


Clu 
CAA 


MKT 
ATC 


Lou 
TTA 


Tro 
CCA 


S«r 

rcc 


Lys 
AAA 


Ala 
CCT 


Cly 
t;r.A 


He 
ATT 


Trp 

TCC 


Arg 

CCC 


2,016 



9 
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TABLE 1 , continued 



Val 
CTC 


Clu 
CAA 


Cys 
TCC 


Lau 

CTT 


lie 
ATT 


Cly 
CCC 


Clu 
CAC 


Htm 

CAT 


Leu 
CTA 


tils 
CAT 


Ala 

CCT 


Cly 

CCC 


MPT 
ATC 


Ser 
ACC 


Thr 
ACA 


Leu 

err 


Phe 
TTT 


Leu 
CTC 


2,034 


Val 

t;TC 


Tyr 
TAC 


Ser 
ACC 


Aan 

AAT 


Lys 
AAC 


Cys 
TCT 


Clu 
CAC 


Thr 
ACT 


Pro 

CCC 


Leu 

CTC 


Cly 
CCA 


MLT 
ATC 


Ala 

CCT 


Ser 
TCT 


Cly 
CCA 


His 

CAC 


lie 
ATT 


Arg 
ACA 


2,052 


Asp 

CAT 


Pha 

TTT 


Cln 
CAC 


He 

ATT 


Thr 
ACA 


Ala 

CCT 


Scr 
TCA 


Cly 
CCA 


Cln 
CAA 


Tyr 
TAT 


Cly 
CCA 


Cln 
CAC 


Trp 

TCC 


Ala 

CCC 


Pro 
CCA 


Lys 
AAC 


Leu 
CTC 


Ala 

CCC 


2,070 


Arg 

ACA 


L«u 

CTT 


His 

CAT 


Tyr 
TAT 


Ser 

TCC 


Cly 
CCA 


Ser 

TCA 


tie 

ATC 


Asn 
AAT 


Ala 
CCC 


Trp 

TCC 


Ser 

ACC 


Thr 
ACC 


Lys 
AAC 


Clu 
CAC 


Pro 
CCC 


Phe 

TTT 


Ser 

TCT 


2,088 


Trp 
TCC 


lie 
ATC 


Lys 
AAC 


Val 

CTC 


Asp 

CAT 


Lau 
CTC 


Lau 

TTC 


Ala 

CCA 


Pro 
CCA 


MET 
ATC 


He 

ATT 


He 

ATT 


His 

CAC 


Cly 
CCC 


lie 

ATC 


Lye 
AAC 


Thr 
ACC 


Cln 
CAC 


2,106 


Cly 
CCT 


Ala 

CCC 


Arg 

CCT 


Cln 
CAC 


Lys 

AAC 


Phe 
TTC 


Sar 

TCC 


Ser 
ACC 


Leu 
CTC 


Tyr 
TAC 


lie 

ATC 


Ser 
TCT 


Cln 
CAC 


Phe 

TTT 


Tie 
ATC 


lie 

ATC 


MET 
ATC 


Tyr 
TAT 


2,124 


Ser 

ACT 


Leu 
CTT 


Asp 

CAT 


Cly 
CCC 


Ly. 

AAC 


Lys 

AAC 


Trp 
TCC 


Cln 
CAC 


Thr 
ACT 


Tyr 
TAT 


Arg 
CCA 


Cly 
CCA 


Asn 

AAT 


Ser 

TCC 


Thr 
ACT 


Cly 
CCA 


Thr 
ACC 


Leu 
TTA 


2,142 


MET 
ATC 


Val 
CTC 


Phe 

TTC 


Ph* 

TTT 


Cly 
CCC 


Asn 
AAT 


Val 
CTC 


Asp 
CAT 


Ser 
TCA 


Ser 
TCT 


Gly 
CCC 


tie 

ATA 


Lys 
AAA 


His 

CAC 


Asn 
AAT 


lie 

ATT 


Phe 
TTT 


Asn 
AAC 


2,160 


Pro 

CCT 


Pro 
CCA 


11* 

ATT 


I la 

ATT 


Ala 

CCT 


Arg 

CCA 


Tyr 
TAC 


lie 

ATC 


Arg 

CCT 


Leu 
TTC 


His 

CAC 


Pro 
CCA 


Thr 
ACT 


His 

CAT 


Tyr 
TAT 


Ser 

ACC 


tie 
ATT 


Arg 

CCC 


2,178 


S«r 

ACC 


Thr 
ACT 


Leu 
CTT 


Arg 

CCC 


MET 
AIC 


Clu 
CAC 


Lau 

TTC 


MET 
ATC 


Cly 
CCC 


Cys 
TCT 


Asp 
CAT 


Leu 
TTA 


Asu 
AAT 


Sor 
ACT 


Cys 

TCC 


Sf r 

ACC 


MET 
ATC 


Pro 

CCA 


2, 196 


Leu 

TTC 


Cly 
CCA 


MET 
ATC 


Clu 
CAC 


Ser 
ACT 


Lys 
AAA 


Ala 

CCA 


He 
ATA 


Ser 
TCA 


Asp 
CAT 


Ala 

CCA 


Cln 
CAC 


tic 
ATT 


Thr 
ACT 


Ala 

CCT 


Ser 

TCA 


Ser 
TCC 


Tyr 
TAC 


2, 214 


Phe 
ITT 


Thr 
ACC 


Asn 
AAT 


MET 
ATC 


Pha 

TTT 


Ala 

CCC 


Thr 
ACC 


Trp 
TCC 


Sar 

TCT 


Pro 

CCT 


Ser 
TCA 


Lys 
AAA 


Ala 

CCT 


Arg 

CCA 


Leu 
CTT 


His 
CAC 


Leu 
CTC 


Cln 
CAA 


2,232 


Cly 
CCC 


Arg 
ACC 


Ser 

ACT 


Asn 


Ala 


Trp 


Arg 

AUA 


Pro 
CCT 


Cln 
CAC 


Val 
CTC 


Asn 
AAT 


Asn 
AAT 


Pro 
CCA 


Lys 
AAA 


Clu 
CAC 


Trp 
TCC 


Leu 

CTC 


Cln 
CAA 


2,250 


Val 
CTC 


Asp 

CAC 


Phe 

TTC 


Cln 
CAC 


Lya 

AAC 


Thr 
ACA 


MET 
ATC 


Lys 

AAA 


Val 

CTC 


Thr 
ACA 


Cly 
CCA 


Val 

CTA 


Thr 
ACT 


Thr 
ACT 


Cln 
CAC 


Cly 
CCA 


Val 
CTA 


Lys 
AAA 


2,268 


Ser 

TCT 


L«u 

CTC 


Leu 

CTT 


Thr 
ACC 


Ser 

ACC 


MET 
ATC 


Tyr 
TAT 


Val 

CTC 


Lys 
AAC 


Clu 
CAC 


Phe 

TTC 


Leu 

CTC 


11* 

ATC 


Sar 
TCC 


Ser 

ACC 


Scr 
ACT 


Cln 
CAA 


Asp 
CAT 


2,286 


Cly 
CCC 


His 
CAT 


Cln 
CAC 


Trp 
TCC 


Thr 
ACT 


Lau 

CTC 


Phe 

TTT 


Phe * 
TTT 


Cln 
CAC 


Asn 
AAT 


Cly 
CCC 


Lys 
AAA 


Vnl 
CTA 


Lys 
AAC 


Val 
CTT 


Phe 
TTT 


Cln 
CAC 


Cly 
CCA 


2, 304 


Asn 
AAT 


Cln 
CAA 


Asp 
CAC 


Ser 
TCC 


Pha 

TTC 


Thr 
ACA 


Pro 
CCT 


Val 
CTC 


Val 
CTC 


Asn 
AAC 


Sor 
TCT 


Leu 
CTA 


Asp 

CAC 


Pro 
CCA 


Tro 

ccc; 


Leu 

TTA 


Leu 

CTC 


Thr 
ACT 


2,322 


Arg 

CCC 


Tyr 
TAC 


L«u 

CTT 


Arg 

CCA 


Tta 

ATT 


His 

CAC 


Pro 

CCC 


Cln 
CAC 


Sar 
ACT 


Trp 

TCC 


Val 

CTC 


His 

CAC 


Cln 
CAC 


Tie 
ATT 


Ala 

ccc 


Leu 
CTC 


Ar« 
ACC 


MKT 
ATC 


2,340 


Clu 
CAC 


Val 

CTT 


Lau 

CTC 


Cly 
CCC 


Cys 
TCC 


Clu 
CAC 


Ala 

CCA 


Cln 
CAC 


Asp 
CAC 


Leu 

CTC 


Tyr 
TAC 


End 
TCA 


CCCTCCCCACTCCATCCCACCTCCCACTC 


2,352 



CCCTCACCTCTCCCTCCTCAOTCCACCCCATCTCTCCCTCCCTCC 

A^CTCCTGAATTAACTATCATCACTCCTCC^m 

TTCTCCACCTCCTCCCACA 
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1000 followed by the amino acid sequence of Asp-158 2 to Argj- 
17 08. That compound thus comprises the polypeptide sequence 
of Ala-2 0 to Pro-1000 covalently linked by a peptide bond to 
amino acids Asp-1582 to Tyr-2351. Another exemplary compound 
5 contains a region X comprising the amino acid sequence Ser-760 
to Thr-778 followed by the sequence Pro-1659 to Arg-1708. 
That compound thus comprises the polypeptide sequence Ala-2 0 
to Thr-778 covalently linked by a peptide bond to the sequence 
Pro-1659 through Tyr-2351. Still another exemplary compound 
10 contains a region X comprising the amino acid sequence Ser-7 60 
to Thr-778 followed by the sequence Glu-1694 to Arg-1708. 
That compound thus comprises the polypeptide sequence Ala-2 0 
to Thr-778 covalently linked by a peptide bond to amino acids 
Glu-1694 through Tyr-2351. 



15 These exemplary compounds are depicted schematically in 



The amino acid sequence represented by X should be selected 
so that it does not substantially reduce the procoagulant 
20 activity of the molecule, which activity can be conveniently 
assayed by conventional methods. Compound (2) of Table 2 is a 



The procoagulant protein may be produced by appropriate host 
25 cells transformed by factor VIII :C DNA which has been specific- 
ally altered by use of any of a variety of site-specific muta- 
genesis techniques which will be familiar to those of ordinary 
skill in the art of recombinant DNA. 

30 The starting materials may be a DNA sequence which codes for 
the complete factor VIII :C molecule, e.g., the complete human 
factor VIII :C as shown in Table 1, a truncated version of that 
sequence, or it may comprise segments of that DNA sequence, so 
long as the starting materials contain at least sufficient DNA 

35 to code for the amino acid sequences of the desired polypeptide. 




Table 2. 



presently preferred embodiment. 
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The proooagu - l e fit -proteins of the present invention , in addition 
to lacking ~a> substantial amino acid segment of human factor 
VIII :C, also have fewer potential N-glycosylation sites than 
human factor VIII. Preferably, at least one. N-glycosylation site 
5 has been deleted* More preferably, 18 of the 25 potential 
N-glycosylation sites are not in the molecule. In still more 
preferred embodiments, up to 19 of the 25 potential N-glycos- 
ylation sites are removed. While not wishing to be bound by 
theory, it is presently believed that the antibodies to factor 
10 VIII :C which are directed to antigenic determinants contained 
in the protein segment deleted in accordance with this inven- 
tion, i.e., in the amino acid segement itself or in the carbo- 
hydrate portion of the glycosylated protein, will not neutralize 
the procoagulant proteins of the present invention. Moreover, 
15 the fact that the procoagulants of the present invention lack 
many of the sites for non-human glycosylation by the non-human 
mammalian or other cells used to produce the proteins is also 
belived to reduce the antigenicity of that protein, and lessen 
the likelihood of developing antibodies to the procoagulants. 
20 This may enable facilitating the treatment of patients in need 
of procoagulant therapy. 

I contemplate that my compounds can be produced by recombinant 
DNA techniques at a much lower cost than is possible for pro- 
25 duction of human factor VIII. The host organisms should more 
efficiently, process and express the substantially simpler 
molecules of this invention. 

The compounds of this invention can be formulated into pharmaceu- 
30 tically acceptable preparations with parenterally acceptable 
vehicles and excipients in accordance with procedures known 
in the art. 

The pharmaceutical preparations of this invention, suitable for 
35 parenteral administration, may conveniently comprise a sterile 
lyophilized preparation of the protein which may be reconsti- 





tuted by addition of sterile solution to produce solutions 
preferably isotonic with the blood* of the recipient. The prepar- 
ation may be presented in unit or multi-dose containers, 
e.g. in sealed ampoules or vials. Their use would be analogous 
5 to that of human factor VIII, appropriately adjusted for potency. 

One method by which these proteins can be expressed is by use 
of DNA which is prepared by cutting a full-length factor 
VIII :C DNA with the appropriate restriction enzymes to remove 
10 a portion of the DNA sequence that codes for amino acids 760 
to 1708 of human factor VIII :C. The cut DNA is then ligated 
with an oligonucleotide that resects the cut DNA and maintains 
the correct translational reading frame. 

15 Preparation of the cDNA has been set forth in detail in U.S. 
Patent Applications Serial Nos. 54 6,650 and 644,08 6, supra . A 
pSP64 recombinant clone containing the nucleotide sequence 
depicted in Table 1, designated as pSP64-VIII, is on deposit 
at the American Type Culture Collection under Accession Number 

20 ATCC 39812. 

Restriction endonucleases are used to obtain cleavage of the 
human factor VIII :C cDNA, hereinafter the DNA source sequence, 
at appropriate sites in the nucleotide sequence. Unless 

25 otherwise noted, restriction endonucleases are utilized under 
the conditions and in the manner recommended by their commercial 
suppliers. The restriction endonucleases selected herein are 
those which will enable one to excise with substantial speci- 
ficity sequences that code for the portion of the factor 

30 VIII :C molecule desired to be excised. BamHI and SacI are 
particularly useful endonucleases. However, the skilled 
artisan will be able to utilize other restriction endonucleases 
chosen by conventional selection methods. The number of 
nucleotides deleted may vary but care should be taken to 

35 insure that the reading frame of the ultimate cDNA sequence 
will not be affected. 
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The resulting DNA fragments are then purified using conventional 
techniques such as those set forth in Maniatis et al., Molecular 
Cloning, A Laboratory Manual (Cold Spring Harbor Laboratory 1982) 
the disclosure of which is incorporated herein by reference, and 

5 Proc. Natl. Acad, Sci. 76:615-619 (1979). The purified DNA is 

/ ' JtL 

then ligated to form the sequence encoding the polypeptide of 
the preferred invention. When necessary or desirable, the 
ligation may be within an oligonucleotide that resects the cut 
DNA and maintains the correct translational reading frame 
10 using standard ligation conditions. Ligation reactions are 
carried on as described by Maniatis et al., su pra at 2453-6 
using the buffer described at page 246 thereof and using a DNA 
concentration of i^ 100 ug/ml, at a temperature of 23 °C for 
blunt ended DNA and 16 °C for "sticky ended" DNA. The following 



a; 



15 double-stranded oligonucleotide is useful when there is BamHI/- 
SacI deletion such as described infra , 

fS 

"7"^ 3 -TCGAGTACCTGGCCTAG 5"; f?^ 



Tx 



5 1 P-CATGGACCG-3 • 



0 but other oligonucleotides can be selected by the skilled artisan 
depending upon the deletions made and reaction conditions. 



The DNA sequences encoding the novel procoagulant polypeptides 
can, in addition to other methods, be derived from the sequence 
25 of human factor VIII :C DNA by application of oligonucleotide-med- 
iated deletion mutagenesis, often referred to as "loopout" muta- 
genesis, as described for example inMorinaga, Y. et al. Biotech- 
nology. . 2: 636-639 (1984). 1 
/ flf 

30 The new DNA sequences containing the various deletions can then 
be introduced into appropriate vectors for expression in 
mammalian cells. The procoagulant activity produced by the 
transiently transfected or stably transformed host cells may 

35 be measured by using standard assays for blood plasma samples. 
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The eukaryotic cell expression vectors described herein may be 
synthesized by techniques well known to those skilled in this 
art. The components of the vectors such as the bacterial 
replicons, selection genes, enhancers, promoters, and the like 
5 may be obtained from natural sources or synthesized by known 
procedures. See Kaufman et al., J. Mol. Biol. . 159: 51rr521 



(1982) ; Kaufman, Proc, Natl. Acad. Sci. 82: 689-693 (1985) 



10 suitable as hosts. Normal diploid cells, cell strains derived 
from in vitro culture of primary tissue, as well as primary 
explants (including relatively undifferentiated cells such as 
haematopoeitic stem cells) are also suitable. Candidate cells 
need not be genotypically deficient in the selection gene so 

15 long as the selection gene is dominantly acting. 

The host cells preferably will be established mammalian cell 
lines. For stable integration of the vector DNA into chromosomal 
DNA, and for subsequent amplification of the integrated vector 

20 DNA, CHO (Chinese hamster ovary) cells are presently preferred. 
See U.S. Patent 4,399,216. Alternatively, the vector DNA 
could include all or parts of the bovine papilloma virus 
genome (Lusky et al., Cell , 36 : 391-401 (9184) and be carried 
in cell lines such as'ci27 mouse bells as a stable episomal 

25 element. Other usable mammalian cell lines include HeLa, 
COS-1 monkey cells, melanoma cell lines such as Bowes cells, 
mouse L-929 cells, 3T3 lines derived from Swiss, Balb-c or NIH 
mice, BHK or HaK hamster cells lines and the like. 

30 Stable transf ormants then are screened for expression of the 
procoagulant product by standard immunological or enzymatic 
assays. The presence of the DNA encoding the procoagulant 
proteins may be detected by standard procedures such as Southern 
blotting. Transient expression of the procoagulant genes 

35 during the several days after introduction of the expression 




Established cell lines, including transformed cell lines, are 
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vector DNA into suitable host cells such as COS-1 monkey cells 
is measured without selection by enzymatic or immunologic 
assay of the proteins in the culture medium. 

5 The invention will be further understood with reference to the 
following illustrative embodiments, which are purely exemplary, 
and should not be taken as limiting the true scope of the present 
invention, as described in the claims. 



15 



20 



25 



30 
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EXAMPLE 1 

10 ug. of the plasmid pACE, a pSP64 (Promega Biotec, Madison, 
Wis.) derivative, containing nucleotides 562-7269 of human factor 
VIII :C cDNA (nucleotide 1 is the A of the ATG initiator meth- 
5 ionine codon) was subjected to partial BamHI digestion in 
lOOul containing 50mM Tris.HCl ph 8.0, 50mM MgCl 2/ and 2.4 
units BamHI (New England Biolabs) for 30 minutes at 37 °C. The 
reaction was terminated by the addition of EDTA to 2 0mM and 
then extracted once with phenol, once with chloroform , ethanol 
10 precipitated and pelleted by centrifugation. DNA was redis- 
solved, cleaved to completion in 50ul using 4 0 units SacI for 
1.5 hours at 37 °C. DNA was then electrophoresed through a 
buffered 0.6% agarose gel. An 8.1 kb fragment corresponding 
to the partial BamHI-SacI fragment of pACE lacking only the 
15 sequence corresponding to nucleotides 2992-4774 of the factor 
VIII :C sequence was purified from the gel using the glass 
powder technique described in Proc. Nat. Acad. Sci. 76 ; 615,^619 
(1979) . Purified DNA was ligated with 100 pmoles of the following 
double-stranded oligonucleotide 

5 1 P-CATGGACCG-3 • 

T~X 3 1 " TCGAGTACCTGGCCTAG 5 1 l^S 



20 



f 



using standard ligation conditions. The DNA sequence removed 
represents the deletion of 584 amino acid sequence beginning 
25 with amino acid 998 and continuing through 1581. The oligo- 
nucleotide inserted, however, encodes amino acids corresponding 
to 99^-1000. Therefore, the polypeptide encoded contains 
deletion of 581 amino acids. 

30 DNA was then used to transform competent E_;_ coli bacteria, and 
DNA from several ampicillin resistant transf ormants was analyzed 
by restriction mapping to identify a plasmid harboring the 
desired SacI-BamHI deletion mutant. DNA from this plasmid was 
digested to completion with Kpnl, which cleaves the plasmid 
35 uniquely at nucleotide 1816 of the factor VIII :C coding se- 
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quence. This DNA was ligated with a Kpnl DNA fragment containing 
nucleotides 1-1815 of factor VIII :C DNA and a synthetic Sail 
site at nucleotides -11 to -5 and then used to transform 
competent E. coli bacteria. ^ 



Plasmid DNA was isolated and oriented by restriction mapping to 
identify a plasmid, pBSdK, containing the correct 5 1 to 3 1 orien- 
tation of the Kpnl insert. Sail digestion, which excises the 
entire polypeptide coding region from the plasmid, was performed 

10 and the DNA electrophoresed through a buffered 0.6% agarose gel. 
The 5.3Kb Sail fragment was purified from the gel as described 
above. This DNA fragment was ligated with Xhol cut pXMT2 DNA 
to give rise to plasmid pDGR-2. pXMT2 is a plasmid capable of 
expressing heterologous genes when introduced into mammalian 

15 cells such as the COS-1 African Green Monkey kidney cell line, 
and is a derivative of the expression vectors described in 
Kaufman, supra at 6 8^-93. The expression elements are the 
same as described for plasmid pQ2 except that it contains a 
deletion of the adenovirus major late promoter extending from 

2Qjj"4 5 to +156 with respect to the transcription start site of 
the adenovirus major late promoter. mRNA expression in pXMT 
is driven by "the SV40 late promoter. The bacterial replicon, 
however, has been substituted to render bacteria containing 
the vector resistant to ampicillin rather than tetracycline. 

25 PXMT2 contains a unique Xho I site at a position which allows 
for expression of inserted cDNA from the SV40 late promoter. 
This Xho I site is convenient for inserting factor VIII :C cDNA 
constructs since these are flanked by Sail sites. 

30 Restriction mapping of transf ormants identified a plasmid, 
pDGR-2, containing the correct 5 1 to 3' orientation of the 
polypeptide coding sequence relative to the direction of 
transcription from the SV4 0 late promoter. pDGR-2 is on deposit 
at the American Type Culture Collection under Accession number 



35 53100. 
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EXAMPLE 2 



f 



Other novel procoagulant proteins may be obtained from constructs 
produced by oligonucleotide mediated deletion mutagenesis, using 
5 for example the "loopout" mutagenesis techniques as described 
in Morinaga et al. , supra . The deletion mutagenesis is performed 
using expression plasmid pDGR-2 or any other appropriate plasmid 
or bacteriophage vector. Other methods for oligonucleotide 
mediated mutagenesis employing single stranded DNA produced with 
10 M13 vectors and the like are also suitable. See Zoller et al., 
Nucl . Acids Res. JLO.: 648*^-6500 (1982) . For example, these 
deletions can be produced u^ing the oligonucleotides j^y 

I (A) 5* AAAAGCAATTTAATGCCACCCCACCAGTCTTGAAACGCCA /^P 

*-^Jl5 (B) 5» AAAAGCAATTTAATGCCACCGAAGATTTTGACATTTATGA /^f~ 



to cause deletions in factor VIII :C cDNA from nucleotides (A) * 

JL? 3 !^- 0 49 7 "—~ (B ^ 2334 to 5079 • The Proteins encoded by these 
constructs contain deletions of (A) 880 and (B) 915 amino acids 
20 relative to Factor VIII :C. 
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The deleted constructs are tested directly, or after subcloning 
into appropriate expression vectors, in order to determine if 
the novel proteins possess procoagulant activity. Procoagulant 
25 activity was assayed as described in Examples 3 and 4 . 
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EXAMPLE 3 

Expression of Procoagulant Molecules in COS Monkey Cells 



30 The expression plasmids containing the modified cDNA's prepared 
as in Examples 1 or 2 and the full-length cDNA, "pXMT-VIII, 
were introduced into COS-1 cells via the DEAE-dextran trans- 
fection protocol. Sompayrac and Dana 1981, Proc. Natl. Acad. 

78: 757^-^7578. Conditioned media was harvested 48 hours 

35 post-transf ection and assayed for factor VHI-type activity as 

described in Toole et. al., 1984, Nature 312:342-347. The 

/ jf 



. 20 



20 

results of the experiment are summarized in Table 3. Both 
plasmids containing the modified cDNAs yielded procoagulant 
activity and, moreover, the activity was greater than that 
obtained using wild type cDNA. From these data it was concluded 
5 that removal of up to 880 amino acids (95,000 daltons) in a 
defined domain of human factor VIII does not destroy cofactor 
activity. Furthermore, these abridged procoagulant proteins 
retain their ability to be activated by thrombin. 
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TABLE 3 : EXPRESSION OF ABRIDGED FACTOR VIII MOIiECULES 



# amino chromogenic Clotek 

acids activity activity 

plasmid deleted fmUml " 1 ) -Ila +IIa (fold) 

No DNA - 0 

10 

pXMT-VIII - 15:1 450 



pDGR-2 581 114 250 5750 (23X) 

15 pLA-2 880 162 * 330 9240 (28X) 



The plasmids indicated were transfected into COS cells and 48 
hr. post-transf ection the conditioned media taken for assay by 
20 the Kabi Coatest factor VIII :C method (chromogenic activity) 
and by the one-stage activated partial thromboplastin time 
(APTT) coagulation assay (Clotek activity) using factor VIII :C 
deficient plasma as described (Toole, Nature 1984) . For 
thrombin (Ila) activation, samples were j/retrea^ed 1t£L0 min, 
25 with 0.2 units/ml thrombin (Ila) at room temperature . Activation 
coefficients are provided in parentheses. Activity from media 
from the wild-type (pXMT-VIII) transfection was too low to 
directly measure Clotek activity before thrombin activ- 
ation. From other experiments where the wild type factor 
30 VIII activity was concentrated, it was demonstrated to be 
approximately 3 0-fold activatable. 
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EXAMPLE 4 

Expression of Procoagulant Molecules in CHO Cells 

^ £A) Expression of pDGR-2 

The procoagulant expression vector containing a deletion 
(relative to the Factor VIII :C cDNA) of 581 amino acids (pDGR-2) 
was transfected with plasmid pAdD26SV(A) #3 (10 ug pDGR-2 : 1 ug 
pAdD2 6SV(A) #3) by CaP0 4 coprecipitation into CHO DHFR deficient 
10 cells (DUKX-B11) and transf ormants isolated and grown in 
increasing concentrations of MTX as described by Kaufman et. al. , 
(1985) . One transformant designated Jl exhibited the following 
activities as a function of resistance to increasing concen- 
trations of MTX. 
L5 



-T7M-1/1 tr uM WTX mUnits/ml/dav/10 6 cells * 
lOlJOA o 1.46 
0.02 322 
0.1 499 
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(B) Expression of pLA-2 



f 



The procoagulant expression vector containing a deletion of 
880 amino acids (pLA-2) was introduced into CHO DHFR deficient 
25 cells (DUKX-B11, Chasin and Urlaub, PNAS 77: 421&^220, 1980 
by protoplast fusion as described (Sandri-Goldin et. al., Mol. 
Cell. Biol. 1: 74^752). After fusion, fresh medium containing 
100 ug/ml of kanamycin, and 10 ug/ml of each of thymidine, 
adenosine, deoxyadenosine, penicillin, and streptomycin and 
30 10% dialyzed fetal calf serum was added to each plate. The 
kanamycin was included to prevent the growth of any bacteria 
which had escaped conversion to protoplasts. Four days later 
the cells were subcultured 1:15 into alpha-media with 10% 
dialyzed fetal calf serum, penicillin, and streptomycin, but 
35 lacking the nucleosides. Colonies appeared after 10 Ij?- 2 day ^ 
after subculturing cells into selective media. A group of & 
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trans formants were pooled and grown in sequentially increasing 
concentrations .of MTX starting at 0.02 uM with steps to 0.1, 
p 0.2, and 1.0 uM MT5£j. Results of factor VHI-type activity in 
cells resistant to increasing concentrations of MTX is shown 
s below. 



uM MTX mUnits/ml/dav/10 6 cells * 

ToViQK o i6 

0.02 530 

10 0-2 1170 

1.0 1890 

* Factor VIII activity was determined by the Kabi Coatest 
factor VIII :C method (chromogenic activity). 
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